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ABSTRACT 

We have analyzed the clustering of C IV and Mg II absorption-line systems on 
comoving scales r from 1 to 16 h~ l Mpc, using an extensive catalog of heavy-element 
QSO absorbers with mean redshift (z)c iv = 2.2 and (^)Mg n = 0.9. We find that, 
for the C iv sample as a whole, the absorber line-of-sight correlation function is 
well-fit by a power law of the form £ aa (r) = (ro/r) 7 , with maximum-likelihood values 
of 7 = 1.75 io'70 and comoving ro = 3.4 ^'q Mpc ( qo = 0.5). The clustering of 
absorbers at high redshift is thus of a form that is consistent with that found for galaxies 
and clusters at low redshift, and of amplitude such that absorbers are correlated on 
scales of galaxy clusters. We also trace the evolution of the mean amplitude £o(v) 
of the correlation function, as a function of redshift, from z = 3 to z = 0.9. We 
find that, when parametrized in the conventional manner as £o(a) oc (1 + z)M 3 + e )+7 ; 
the amplitude grows with decreasing redshift, with maximum-likelihood value for 
the evolutionary parameter of e = 2.05 ± 1.0 (qo = 0.5). When extrapolated to zero 
redshift, the amplitude of the correlation function implies that the correlation length 
ro = 30 h -1 Mpc (qo = 0.5) . This suggests that strong C IV and Mg II absorbers, 
on megaparsec scales, are biased tracers of the higher-density regions of space, and 
that agglomerations of strong absorbers along a line of sight are indicators of clusters 
and superclusters. This is supported by recent observations of “Lyman break” galaxies. 

The growth seen in the clustering of absorbers is consistent with gravitationally 
induced growth of perturbations. 
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1. Introduction 


The exact nature of QSO heavy-element absorption-line systems is not yet well understood. 
Nevertheless, there is growing evidence suggesting that these absorbers are associated with galaxy 
halos and disks. Deep images of fields around QSOs with absorbers in their spectra have been 
made, and galaxies have been found at the redshifts and in the vicinities of absorbers (see, e.g., 
Steidel, Dickenson, fc Persson 1994 ; Steidel et al. 1997|) . The distances between galaxies and the 
associated absorbers are inferred to be of order 100 kpc, or the size of an extended galaxy halo 
( Churchill, Steidel, fe Vogt 1996| ). This association has also been inferred for a substantial fraction 
of the Lya absorbers (Lanzetta et al. 1995; Le Brun, Bergeron, &; Boisse 1996| ), but they are not 
the subject of this work. 

One approach in relating absorbers to galaxies is to compare their respective clustering 
properties. In a previous paper, Quashnock, Vanden Berk, & York (1996, hereafter QVY) analyzed 
line-of-sight correlations of C IV and Mg ii absorption-line systems on large scales, using an 
extensive catalog of 2200 heavy-element absorption-line systems in over 500 QSO spectra ( |Vanden 
Berk et al. 1998| ). (More details about the catalog can be found in an earlier version of the catalog 
||York et al. 199 If] , as well as in QVY.) These authors found that the absorbers are clustered, like 
galaxies, on comoving scales of 50 to 100 h~ l Mpc (qo = 0.5). 

This high-redshift (z from 1.5 to 3.5) superclustering of the absorbers is on the same comoving 
scale as that traced by the voids and walls of galaxy redshift surveys of the local universe (see, 
e.g., Kirshner et al. 1981 ; Geller fe Huchra 1989| ; Landy et al. 1996|) . It thus appears that the 
absorbers are tracing the same structure as that traced by galaxies on very large scales, and that 
they are effective probes of very large scale structure in the universe (see, e.g., |Crotts et al. 1985 


[Tytler, Sandoval, fe Fan 1993 ). The authors of QVY argued that this super clustering is generic 
(they found potential superclusters along 7 different lines of sight); indeed, such superclustering 
has been detected by many studies (see, e.g., |Heisler, Hogan, fe White 1989 ; Dinshaw &; Impey 


1996[ [Williger et al. 199Gp and refs, in QVY) 


In this paper, we extend the QVY analysis of line-of-sight correlations of C iv and Mg n 
absorption-line systems to smaller comoving scales — from 1 to 16 /t _1 Mpc, i.e., galaxy cluster 
scales, corresponding to line-of-sight velocity differences Av ~ 200 - 3000 km s -1 at (z )c iv = 2.2 
— and relate the small-scale clustering of absorbers to galaxy clustering in general. 


Early studies (|Young, Sargent, fc Boksenberg 1982| ; Sargent, Boksenberg, &: Steidel 1988|) , 
with approximately 100 km s _1 spectral resolution, found that C IV absorbers cluster significantly 
on these scales, with a correlation function £ ~ 5 to 10 for line-of-sight velocity differences 
Av = 200 - 600 km s . On smaller scales (Av < 200 km s -1 ), internal motions of the gas 
comprising the absorption systems are expected to dominate (Heisler et al. 1989), if the absorbing 
gas is moving in gravitational orbits inside galactic halos with typical cloud velocity dispersions 
~ 200 km s^ 1 ) Davis fe Peebles 1983 ). Indeed, Petitjean & Bergeron (1994), using higher spectral 


resolution data, found that the C IV line-of-sight correlation function can be described by two 
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Gaussian components, with velocity widths a v = 109 and 525 km s , with the latter component 
accounting for almost all (93%) of the signal in £ for Av = 200 - 600 km s . A similar result, 
with even higher resolution Keck High Resolution Echelle Spectrometer (HIRES) spectra, has 
been found by Rauch et al. (1996). 


A natural interpretation (see, e.g., Sargent et al. 1988) of these results is that the low-velocity 
component (Av < 200 km s -1 ) is due to gas motion inside the absorbers, with the higher-velocity 
component (Av > 200 km s -1 ) being due to actual spatial clustering of the absorbers on galaxy 
cluster scales (several Mpc). However, because the line-of-sight velocity dispersion of typical 
galaxy clusters ranges from 300 - 1400 km s -1 ( ^abludoff et al. 1993 ; Dinshaw fe Impey 1996 ), the 
higher-velocity component may represent, at least in part, motion of absorbers inside a galaxy 
cluster, rather than true spatial clustering ( Shi 1995 ). This has been argued by Crotts, Buries, & 
Tytler (1997), who explore the spatial clustering of C IV systems along adjacent lines of sight, and 
claim that it is significantly weaker than clustering along a line of sight. 


We return to the question of interpretation of the clustering of heavy-element absorption 
lines in § 6. Our interest here is in the form and evolution of that clustering on comoving scales of 
1 /i -1 Mpc and greater (Au > 180 km s -1 ), corresponding to the higher-velocity component. We 
have taken steps (§ 3) to insure that there is no aliasing of power from the low-velocity component. 

The outline of the paper is as follows. We present our likelihood method of estimating the 
correlation function from the data in § 2. We describe our data selection criteria in § 3. We then 
discuss the form of the correlation function in § 4. We also present results on the evolution of 
the clustering, by calculating its redshift dependence in § 5. We discuss some of the implications 
of these results in § 6, and summarize them in § 7. In the Appendix, we explicitly calculate the 
likelihood as a function of data and model parameters. 


2. Likelihood Method of Estimating the Absorber Correlation Function 

Here we outline our likelihood method of estimating the line-of-sight correlation function, 
£ aa (r, ")• (We defer a detailed calculation of the likelihood function to the Appendix.) Unless 
otherwise noted, we take go = 0.5 and A = 0. We follow the usual convention and take the Hubble 
constant to be 100 h km s -1 Mpc -1 . 

The correlation function £ aa (r, z) relates the observed number of absorber pairs, separated 
by comoving distance r at redshift z, with the expected number of pairs, of the same separation 
and redshift, from an unclustered absorber population. The data D consist of the observed set 
of absorber pairs i, each with comoving separation n at redshift Zj. Normally, in calculating the 
correlation function, these data are binned, and £ aa (f, z) is obtained by dividing the observed 
number of pairs in a bin by the number expected in the bin. In our likelihood approach, no such 
binning is necessary. 










-4- 


Consider a given model for £ aa (r, z). Each such model is described by a set of parameters M, 
and we wish to find the parameter values which best describe the observed data D. To do so, we 
must maximize the likelihood L , (calculated in the Appendix, eq. [A4]), which is the probability of 
the data D, given the model parameters M for the correlation function £ aa (r, z): C = P(D\M). 
Given a prior distribution, P(M ), for the model parameters (usually obtained from physical or 
symmetry arguments), we can use Bayes’ Theorem to then calculate the posterior probability 
distribution, P(M\D ), for the model parameters M, given the data D (see Loredo 1992| for a 
compendium of astrophysical applications of Bayesian inference): 


P(M\D) = P(D\M) x P{M)/ / P(D\M)P(M)dM 


(1) 


The parameter values which maximize the likelihood are those we take as the “best-fit” values, 
and credible regions for the parameters around the maximum-likelihood values are obtained from 
equation (1), by requiring that a fiducial amount of posterior probability (such as 68.3% or 95.5%, 
nominal 1 and 2 a amounts) be contained inside the credible region. 

In this paper, we consider three models for the correlation function. The first is a simple 
power law, with comoving clustering length ro and power-law index 7 : 

£aa (r, 2 ) = (r 0 /r) 7 . (2) 


The second is also a power law, but with an amplitude that evolves with decreasing redshift as a 
power of the expansion factor: 


£aa (r,z) = (r 0 /r) 


1 + z 
1 + Zo 


-(3+e)+7 


( 3 ) 


Again r$ is the comoving clustering length (at a fixed redshift zq), e is the conventional evolutionary 
parameter flGroth fc Peebles 1977 ; Efstathiou et al. 199 1| ), and 7 is a power-law index that does 
not evolve with redshift. The third model is that of a power-law correlation function evolving 
according to linear theory of gravitational instability (Peebles 1980|, 1993): 


£aa (r,z) = (r 0 /r)' Y g 2 (z,fl 0 ,^) , 


( 4 ) 


where g(z, 12q, A) is the growth factor. When Oq = 1 and A = 0, g oc (1 + z) 1 


3. Selection of the Data from the Absorber Catalog 

We have drawn our C iv and Mg 11 data sample from the heterogeneous catalog of Vanden 
Berk et al. (1998) — with the aim of constructing as homogeneous a dataset as possible — by 
accounting for the spectral wavelength coverage, spectral sensitivity, and spectral resolution of 
each line of sight. The selection criteria we used to produce the final set of absorption systems are 
similar to those in QVY, and we outine them here. 
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First, the Lya forest region, where heavy-element line identifications are often unreliable, was 
excluded in each spectrum. The so-called associated region, within 5000 km s^ 1 of each QSO, 
was also excluded, because the origin of the systems in that region may be different than that of 
systems farther removed from the QSO (Foltz et al. 1988). Only systems with at least a C IV 
or Mg II doublet were selected; in that case, the intrinsically stronger doublet component was 
required to have been detected at the 5cr or greater significance level. In addition, a system had 
to have at least one more identified heavy-element line, or have a doublet equivalent width ratio 
that is consistent with atomic physics (within the measurement errors). 

Since we are measuring correlations of absorber pairs with comoving separations of 1 h^ 1 Mpc 
or greater, we selected only those spectra or spectral regions in which the spatial resolution was 
better than this limit. All systems lying within 1 h~ l Mpc of each other, in the same line of 
sight, were combined into a single system by averaging the redshifts and taking the equivalent 
width of the strongest system. This has the effect of minimizing the aliasing of power from scales 
smaller than 1 h~ l Mpc — corresponding to An = 180 km s -1 in the rest frame at ( z)c iv = 2.2 
— where internal motions of the gas comprising the absorption systems are thought to dominate 
(Heisler et al. 1989; and § 1 above). Since 180 km s -1 is 1.6 times as large as the low-velocity 
width ( [Petit jean &; Bergeron 1994 , Fig. 4a), most of the clustering from the low-velocity Gaussian 
component has been removed by our selection procedure. 


Applying all the selection criteria above to the Vanden Berk et al. (1998) catalog leaves a 
data sample consisting of 260 C IV absorbers, drawn from 202 lines of sight, with redshifts ranging 
from 1.2 < 2 < 3.6 and mean redshift (z)c iv = 2.2, and 64 Mg II absorbers, drawn from 278 lines 
of sight, with redshifts ranging from 0.3 < z < 1.6 and mean redshift (z)Mg n = 0.9. 


4. Form of the Correlation Function 

Figure 1 shows the line-of-sight correlation function £ aa (r), f° r the entire sample of C iv 
absorbers (with mean redshift ( z)c iv = 2.2), as a function of absorber comoving separation r 
from 1 to 16 /i _1 Mpc, in 4 octaves. We have restricted our analysis to these scales, because data 
on scales larger than 16 /i -1 Mpc is too scarce to be used unless it is put in very large bins, and 
because we did not want to examine scales smaller than 1 /i _1 Mpc. 

The correlation function is computed by comparing the distribution of real absorber pair 
separations with the expected distribution for an unclustered sample. We calculated the expected 
distribution using Monte Carlo simulations, creating a large number of fake unclustered catalogs, 
using the following “bootstrap” method (see QVY). For each Monte Carlo simulation, the same 
number of absorption systems as in the real sample catalog was chosen for random distribution. 
The redshifts and equivalent widths of the random systems were kept the same as the real 
systems, as were the spectral properties of the observed QSO lines of sight. The systems were then 
randomly placed along the lines of sight, subject to the condition that a system could have been 
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detected in the given line of sight. This method randomizes the absorber placements, while taking 
into account the redshift and equivalent width number density, which remain exactly the same as 
in the real sample. The expected distribution of pair separations was then calculated by averaging 
the pair distribution over 1000 Monte Carlo simulations. The vertical error bars through the data 
points are 1 a errors in the estimator for £ aa ( Peebles 1980 ), and were computed in the same 
fashion as in QVY. 


The first point in Figure 1 shows £ aa ( r ) for r between 1 and 2 h^ 1 Mpc, corresponding to 
180 - 360 km s ^ 1 at (z) c iv = 2.2. We find that £ aa = 5.4 ± 1.7, a value consistent with several 
previous determinations: (1) £ aa = 5.7 ± 0.6 for Av = 200 - 600 km s ^ 1 (sample A2 of Sargent et 
al. 1988); (2) £ aa ~ 6 at 360 km s^ 1 (Fig. 3 of Petitjean & Bergeron 1994); and (3) £ aa ~ 4 at 
300 km s^ 1 (Fig. 4 of Rauch et al. 1996). However, our dataset is large enough to measure £ aa on 
bigger scales and to determine the form of the correlation function more accurately. 


From Figure 1, the absorber line-of-sight correlation function on smaller scales can be 
described by a power law. However, some information has been lost because of the binning of data 
required to make the errors in Figure 1 reasonably small. Accordingly, a standard y 2 -fit to the 
data is not the most powerful method available. 

Instead, we have used the maximum-likelihood method of § 2 (eq. [A4]) to fit a power law 
of the form given in equation (2) to the unbinned C IV data. We find maximum-likelihood values 
for the parameters, obtaining tq = 3.4 h~ l Mpc and 7 = 1.75. This fit is shown in Figure 1, and 
describes the data quite well. 


Figure 2 shows the credible region for these parameters. The cross marks the maximum- 
likelihood location, and the heavy and light contours mark the 1 a and 2 a credible regions, 
respectively. In calculating the credible regions, we have used Bayes’ Theorem (eq. [1]) and taken 
uniform priors P(M) in the power law intercepts to Figure 1, corresponding f] to uniform priors 
P(M ) in the parameters log 2 ro and 7 . 

We condense the information in Figure 2 by showing the marginalized posterior distributions 
for the parameters 7 and ro in Figures 3 and 4, respectively. The vertical lines mark the 1 a 
credible regions, which we find to be 1.05 < 7 < 2.25, and 2.4 h~ 1 Mpc < ro < 4.1 h~ l * Mpc 
(qo = 0.5). Thus, for the C IV sample as a whole, the absorber correlation function is well-fit by 
a power law of the form £ aa (r) = (?’o/r) 7 , with maximum-likelihood values of 7 = 1.75and 
comoving correlation length ro = 3.4^ 1 Mpc (c/o = 0.5). 

Unfortunately, the Mg II sample, with only 64 absorbers (with mean redshift (z)Mg n = 0.9) 
compared to 260 absorbers in the C IV sample, is not large enough to adequately model the form 


2 Since we have no a priori information on the scale of clustering, r 0, nor on its overall amplitude, £ aa (l h Mpc) 

= rj), we take uniform priors in logro and log£ aa (l hr 1 Mpc) = ylogro (the intercepts in Fig. 1 ). Thus the prior 

probability density dP oc d(logro) x dp logro), or, using a Jacobian to change to our two independent variables of 

interest, 7 and ro, dP oc d-y x d(log 2 ro). 
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of the correlation function. The data are, at the 95% confidence level, consistent with all values of 
7 > 1.55. Nevertheless, the sample is large enough to constrain the comoving correlation length, 
if we fix the value of the power-law index at 7 = 1.75. In this case, we find that ro = 8.4 ± 2.0 
h~ l Mpc. This larger value — assuming that C IV clustering is directly comparable to that of 
Mg 11 (see § 5 below) — suggests that the amplitude of the correlation function has grown between 
redshift (z )c iv = 2.2 and redshift (z)Mg n = 0.9. In the following section, we find that this is 
indeed the case, by investigating the evolution of the correlation function, using the entire sample 
of C IV and Mg 11 absorbers. 


5. Evolution of the Correlation Function 


We have investigated the evolution of the clustering of absorbers by dividing the C IV 
absorber sample into three approximately equal redshift sub-samples, and comparing these to the 
Mg 11 sample. Figure 5 shows the mean of the correlation function, £o(z)> averaged over comoving 
scales r from 1 to 16 h~ 1 Mpc, for the low (1.2 < 2 < 2.0), medium (2.0 < 2 < 2.8), and high 
(2.8 < 2 < 3.6) redshift C IV sub-samples, as well as for the Mg 11 sample (0.3 < z < 1.6). The 
amplitude of the correlation function is clearly growing with decreasing redshift. 


We have used both the C iv and Mg 11 samples to study the evolution of heavy-element 
absorber clustering with redshift, since these samples cover almost disjoint redshift intervals. 

For the following reasons, we believe that it is reasonable to do this. (1) Caulet (1989), in a 
high-sensitivity survey of Mg 11 and Mg I lines found in C IV systems, found that 60% to 70% of 
the Mg II population (1 < z < 2) have moderate or strong C IV absorption lines (equivalent width 
> 0.3 A, like the bulk of our sample), the rest having either weak (equivalent width < 0.3 A) or 
undetected lines. Thus, Mg II systems can reasonably be expected to trace a similar clustering 
distribution as stronger C iv systems, and a correlation analysis should yield similar results, within 

using loose 


the errors of £o(^) i n Figure 5. (In the absorption catalog [Vanden Berk et al. 11 
selection criteria, we find that about 67% of the Mg 11 systems have corresponding detected C IV 
lines when they were accessible. However, after applying the stricter selection criteria (§ 3) used 
for the clustering analysis, only 7 Mg il/C IV systems remain. This is not sufficient to directly 
compare the clustering of each species.) (2) The work of Petitjean & Bergeron (1990, 1994) 
suggests that the clustering properties of C IV and Mg 11 may be directly comparable. (3) It seems 
highly unlikely that the distribution of C IV and Mg II systems would be different, in a statistical 
sense, on scales as large as those studied here, i.e., 1 hr 1 Mpc to 16 /r _1 Mpc, corresponding to 
galaxy cluster and intercluster scales. That would imply a large-scale segregation of species. (4) 
The correlation function measured for Mg II systems is consistent with an extrapolation of the 
best-fit to the C IV data alone (see below). 


We have used the maximum-likelihood formalism of § 2 and the Appendix to fit the unbinned 
C iv and Mg 11 data, and describe the evolution of the correlation function, using the evolving 
power law, given by equation (3). We fix zq at the mean C IV redshift of 2.2, so that the parameter 





ro corresponds to the correlation length at that redshift, enabling a direct comparison with the 
results of § 3 for the C iv sample as a whole (see below). Since the data for the Mg n sample, 
as well as for the C iv sample as a whole, are consistent with a constant value of the power-law 


index 7 , and since growth of hierarchical clustering in gravitational instability theory (Peebles 


1980|, 1993) and in numerical simulations occurs with a nearly constant value of 7 , we have fixed 


7 at its maximum-likelihood value of 1.75 in our analysis below, and parametrized the evolution 
of the correlation function in terms of the evolutionary parameter e and the correlation length ro 
at redshift zq = 2 . 2 . 


Figure 6 shows the credible region for these parameters. The heavy and light contours mark 
the 1 u and 2 a credible regions, respectively. The cross marks the maximum-likelihood location, 
which we find to be e = 2.05 and ro = 3.2 h~ l Mpc. This fit is the one shown as a solid line in 
Figure 5. 

In order to check our assumption that Mg 11 and C IV clustering are directly comparable, we 
also show the maximum-likelihood fit to the C iv data alone (shown as a dashed line in Fig. 5), 
which we find to be e = 2.3 and ro = 3.3 h~ l Mpc. The correlation function measured for Mg 11 
systems is clearly consistent with the value extrapolated from C iv at higher redshift. The error 
in the estimate of the evolutionary parameter e is of course larger if we use the C iv data alone 
(e = 2.3± 1.5); hence, we have included the Mg 11 data in our analysis, for the reasons given above. 

We show the marginalized posterior distributions for the parameters ro and e, in Figures 
7 and 8 , respectively. The vertical lines mark the 1 a credible regions, which we find to be 2.6 
h~ l Mpc < ro < 3.8 hr 1 Mpc, and 1.05 < e < 3.05. All these results are with qo = 0.5. With 
qo = 0.1, we expect our estimate of e to decrease from 2.05 to 0.8. 

It is interesting to compare the posterior distribution for ro in Figures 4 and 7. Both have a 
maximum near the same location at ro ~ 3.3 h Mpc, but the distribution in Figure 7 is definitely 
narrower. This is not surprising, since the evolution in the amplitude of the correlation function 
(and hence in the correlation length) has smeared the distribution of ro in Figure 4. There, ro 
represents the correlation length for the C IV sample as a whole (with mean redshift (z)c iv = 2.2), 
whereas in Figure 7 we have allowed for evolution, with ro representing the correlation length 
exactly at zq = 2.2. The fact that the distribution in Figure 7 is narrower is more evidence in 
favor of an evolving correlation function. With e = 2.05 ± 1.0, the no-evolution case (e = —1.25) 
is in fact ruled out at the 3.3 a (99.95 %) confidence level. 


6. Discussion 

We have analyzed the clustering of C iv and Mg 11 absorption-line systems on comoving scales 
r from 1 to 16 h -1 Mpc, using a new likelihood method that does not require the usual binning 
used to compute the correlation function. We find that the form of the line-of-sight correlation 
function is well-described by a power law of the form £ aa (r) = (^oA) 7 ) with maximum-likelihood 
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values for the power-law index of 7 = 1.75 70 - The clustering of absorbers at high redshift is 

thus of a form that is consistent with that found for galaxies and clusters, at low redshift, on 
megaparsec scales (7 = 1.77 ± 0.04 for galaxies [ Davis fe Peebles 1983 1, 7 = 2.1 ± 0.3 for clusters 
[Nichol et al. 19921). It appears that the absorbers are tracing the large-scale structure seen in the 
distribution of galaxies and clusters, and are doing so at high redshift. The finding strengthens 
the case for using absorbers in probing large-scale structure. 


We find that the amplitude of the correlation function, characterized by the correlation length 
?’o, is growing with decreasing redshift. For the entire C iv absorber sample, with mean redshift 
(z)c iv = 2.2, we find that ro = 3.415 'q h~ l Mpc (go = 0.5), whereas for the Mg II sample, with 
mean redshift (^)Mg n = 0.9, we find that ro = 8.4 ± 2.0 /r ” 1 Mpc. The C IV sample is large 
enough to be divided into several sub-samples; by using all the data sets, and parametrizing the 
amplitude of the correlation function in the usual manner as £ 0 ( 2 ) ck (1 + z)”l 3+l 0 +7 , we find that 
the growth is reflected in a value for the evolutionary parameter of e = 2.05 ± 1.0 (go = 0.5). 


13 cm 2 1 


Evidence for a trend of increasing clustering of Lya absorbers [with iV(H 1 ) > 6.3 x 10 
with decreasing redshift has been found by Cristiani et al. (1997). Like Crotts (1989), these 
authors also find a clear trend of increasing Lya absorber clustering with increasing column 
density, and find that an extrapolation to column densities typical of heavy-element systems 
[!V(H 1 ) > 10 16 cm” 2 ] is consistent with the clustering observed for C IV absorbers ( Petitjean fe 


[Bergeron 1994 ; |Songaila &: Cowie 1996 ). Our finding of growth in the clustering of heavy-element 
systems with decreasing redshift supports both a continuity scenario between Lya and heavy- 
element systems ( Crotts 1989| ; Tytler et al. 1995 ; Cowie et al. 1995 ; Cristiani et al. 19~97| ), and the 
common action of gravitational instability. 


Fernandez-Soto et al. (1996) have investigated the correlation function of Lya absorbers [with 
JV(H 1 ) > 3 x 10 14 cm” 2 ] at high redshift, by using corresponding C IV systems as tracers. After 
comparing the observed correlations at high redshift to the galaxy correlation length at present 
(viz., ro ~ 5.5 /i” 1 Mpc), they find that the evolutionary parameter e is anywhere between 1.5 
and 2.8 (approximate 1 a confidence; see Fig. 4 of Fernandez-Soto et al. 1996), consistent with 
our finding. However, their determination, unlike ours, is dependent upon comparing absorber 
clustering in the past with galaxy clustering at present. This is problematic, since the exact 
relationship between the stronger Lya absorbers, presumed to be halos of protogalaxies, and the 
corresponding population of galaxies today is unknown, and different types of galaxies are known 
to cluster with differing strengths. Nevertheless, their determination of Lya absorber clustering 
is broadly consistent with what is expected for galaxies at high redshift, and their estimate of the 
evolutionary parameter e is consistent with ours. 


The growth of the correlation function of heavy-element absorbers, as shown in Figure 5 and 
if extrapolated to zero redshift using the evolving power law (eq. [3]), implies a large value for the 
present-day value of the correlation length: ro = 30l 2 g /i” 1 Mpc (go = 0.5) . This value is several 
times larger than the average value for field galaxies today, viz., ro = 5.4 ± 1.0 h ~ 1 Mpc (pavis fc 
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Peebles 1983), but is comparable to that of clusters, viz., ro = 16.4 ± 4.0 h 1 Mpc flNichol et al. 


1992). The correlation length of galaxies today is known to depend on luminosity and type (Le 
Fevre et al. 199(j|), ranging up to 7.4 h Mpc in the Stronrlo-APM redshift survey (Loveday et al. 


1995) and up to 14.5 h 1 Mpc in the Second Southern Sky Redshift Survey (SSRS2; Benoist et 


al. 1996), for the most luminous early-type galaxies. Comparing all these values of the correlation 
length suggests that, on nregaparsec scales, the strong C IV and Mg II absorbers in the Vanden 
Berk et al. (1998) catalog are tracing the distribution of clusters, or of the most luminous super-L* 
galaxies in those clusters. 

Of course, extrapolating our best fit in Figure 5 to zero redshift is highly uncertain, as 
reflected in the broad range in our estimate of ro today, and is dependent on an assumed power-law 
form for the evolution of the correlation function. This makes a determination of heavy-element 
absorber clustering at low redshift all the more important; however, the sample of low-redshift 


heavy-element absorption-line systems (see, e.g., Bahcall et al. 1996; Vanden Berk 1997) is too 
small at present to be useful in this regard. 

It is not yet clear whether the correlation function shown in Figure 1 represents motion of 
absorbers inside a galaxy cluster ( phi 1995 ; Protts, Buries, fe Tytler 1997 ), rather than true 
spatial clustering, since along a line of sight we cannot distinguish peculiar velocities from the 
Hubble flow. In fact, on the scales considered here, the same ambiguity would arise with galaxies 
themselves. What is clear, however, is that the scale (r o = 3.4 h~ 1 Mpc, corresponding to Av = 
610 km s” 1 at ( z)c iv = 2.2), the form (£ oc r -1 ' 75 ), and the amplitude of the clustering in Figure 1 
all are indicative of an association of strong absorbers with clusters. Furthermore, this clustering 
is growing with time (Fig. 5), and could represent actual growth in spatial clustering (see below) 
or an increase in the velocity of absorber motions inside clusters due to increasing cluster masses. 


If heavy-element absorbers are indicative of galaxy clusters, how do their number densities 
compare? The number of strong heavy-element absorption-line systems per unit redshift along a 
line of sight is dAf/dz ^ 1 (see, e.g., Vanden Berk et al. 1996|) . Taking n and o to be, respectively, 
the characteristic comoving number density and effective cross section of the absorbers, and 
defining dJ\f /dR to be the number of systems per comoving length, we obtain ( |Murdoch et al 
q 0 = 0.5): 

dU dN H 0 


1986 


ns = lM = lh x - (1 + :) ’- 


(5) 


Taking for an upper limit of a a cross section characteristic of clusters having size an Abell radius 


a < 7T r 2 = 7 h 


-2 


Mpc 2 , we find that n > 10 4 h 3 Mpc 3 at z ^ 1 flCrott: 


r a = 1.5 h _1 Mpc, viz 

|1989[) , which is at least twice the density of richness class R > 0 clusters found by Postman et al. 
(1996). This means there must be, on average , several strong heavy-element absorbers per cluster 
in space. 


Then, if the strong clustering seen in the line-of-sight correlation function of these absorbers 
is indicative of clustering on galaxy cluster scales, they must be biased tracers of higher density 
regions of space. This suggests that agglomerations of strong absorbers along a line of sight 
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are indicators of clusters and superclusters (see QVY). This relationship is supported by the 
following. (1) Steidel et al. (1998) have recently discovered a large concentration of 15 “Lyman 
break” galaxies, at redshift 2 = 3.09 ± 0.02, of size 11' by 8 ' on the plane of the sky, corresponding 
to 10 h -1 Mpc by 7 h~ l Mpc (comoving, flo = 1). In studying the spectrum of a backlighting 
QSO (z em = 3.356) that was also discovered in the field, they found a C iv doublet, as well as a 
strong Lya system, at 2 = 3.094, right at the redshift of the concentration. (Two other prominent 
features, albeit of lower significance, were also seen in the redshift histogram of the Lyman break 
galaxies, and both have counterparts in metal line systems.) A similar coherence between the 
large-scale distribution of high-redshift galaxies (z = 2.38) and that of absorbing gas has also been 
reported by Francis et al. (1996). ( 2 ) Giavalisco et al. (1998) have estimated the amplitude of the 
correlation function of the aforementioned Lyman break galaxies at a median redshift z = 3.04, 
and find a correlation length ro = 2.1 ± 0.7 h -1 Mpc (qo = 0.5), a value consistent with that we 
have found for the C iv absorbers in Figure 5 (ro = 2.2 h -1 Mpc at z = 3.04). 

In a detailed three-dimensional numerical investigation of the evolution (from z = 5 to 
z = 2) of structure in the distribution of Lya absorbers, Zhang et al. (1998) have found that the 
richest absorbers [iV(H i) > 10 15 cm -2 ], and, correspondingly, heavy-element systems, tend to 
concentrate in nodules at the intersection of filamentary structures (with comoving separation 
of a few megaparsecs at z = 2) comprising the clustering network. Those nodules appear to 
represent the locus of galaxy clusters. The scale of clustering found in our study of heavy-element 
absorption-line systems (ro = 3.2 /i -1 Mpc at z = 2.2) appears consistent with this picture, and 
suggests that these systems are indeed tracing the richer agglomerations of the clustering network. 
The previous finding by QVY of superclustering of C iv absorbers on 100 h~ l Mpc scales suggests 
the existence of clustering in the nodules as well. 


The strong clustering that we find in the heavy-element absorption-line systems is also 
not surprising, given that most of the sample consists of the strongest systems with relatively 
large equivalent widths (order 0.4 A and greater), and the recent claims ( Cristiani et al. 1997] ; 
D’Odorico et al. 1998| ) of a strong dependence of clustering strength on the column density of 
the systems. The median C iv A1548 rest equivalent width in our sample is (W) = 0.4 A, and 
there are few systems with W < 0.1 A. We have divided the sample in half by equivalent width, 
and computed the mean correlation function £o averaged for comoving scales between 1 /i -1 Mpc 
and 16 /i -1 Mpc. We find that the weaker systems appear to be less clustered than the stronger 
systems: £o(W < 0.4 A) = 0.87 ± 0.45, £o(kF > 0.4 A) = 1.73 ± 0.60; however, the sample size is 
not sufficient to significantly test the dependence of clustering on equivalent width. 


The growth in the correlation function and the value of the evolutionary parameter 
(e = 2.05 ± 1.0) that we find, are consistent with gravitationally induced growth of perturbations. 
For qo = 0.5, from linear theory of gravitational instability ( Peebles 1980| , 1993) £ oc (1 + z) -2 
e = 0.75 if 7 = 1.75. Also, from numerical simulations (Melott 1992; Parlberg et al. 1997| ; [Colin 
Carlberg, &; Couchman 1997| ), e = 1.0 ± 0.1. 


i.e. 
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It is possible (eq. [4]) to examine the evolution of the correlation function, using the growth 
factor g(z,fl o,A) from linear theory of gravitational instability ( Peebles 198C| , 1993); however, 
because of the uncertainties in our determination of the evolutionary parameter e and our lack 
of understanding of the exact nature of the heavy-element absorbers, it is not possible to derive 
values for the cosmological parameters Do and A from this work. A fruitful approach in the future 
may be more detailed numerical simulations of the heavy-element absorber network, like those of 
Zhang et al. (1998) for the Lya forest. In addition, results from the upcoming Sloan Digital Sky 
Survey will add greatly to our understanding of the absorbers. 


7. Summary 


We summarize our results: 

1. We have analyzed the clustering of C iv and Mg n absorption-line systems on comoving 

scales r from 1 to 16 h~ l Mpc, using an extensive catalog of heavy-element QSO absorbers with 
mean redshift (z) c i V = 2.2 and (-s)Mg n = 0-9- We find that, for the C iv sample as a whole, the 
absorber line-of-sight correlation function is well-fit by a power law of the form £ aa (r) = (?"o/ r ) 7 > 
with maximum-likelihood values of 7 = 1.75 ^070 an d comoving ro = 3.4 + / j -1 Mpc (qo = 0.5). 

2. The clustering of absorbers at high redshift is thus of a form that is consistent with that 
found for galaxies and clusters at low redshift, and of amplitude such that absorbers are correlated 
on scales of clusters of galaxies. It appears that the absorbers are tracing the large-scale structure 
seen in the distribution of galaxies and clusters, and are doing so at high redshift. The finding 
strengthens the case for using absorbers in probing large-scale structure. 

3. We also trace the evolution of the mean amplitude £o(~) of the correlation function, 
as a function of redshift, from z = 3 to z = 0.9. We find that, when parametrized in the 
conventional manner as £o( 2 ) oc (1 + ,z)~( 3+e ) +7 , the amplitude grows with decreasing redshift, 
with maximum-likelihood value for the evolutionary parameter of e = 2.05 ±1.0 (go = 0.5). 

4. When extrapolated to zero redshift, the amplitude of the correlation function implies that 
the correlation length ro = 30 ^3 h^ 1 Mpc (qo = 0.5), suggesting that strong C iv and Mg II 
absorbers, on megaparsec scales, are tracing the distribution of clusters of galaxies, or of the most 
luminous super-L* galaxies in those clusters. The scale of clustering suggests that these absorbers 
are biased tracers of the higher-density regions of space, and that agglomerations of strong 
absorbers along a line of sight are indicators of clusters and superclusters. This is supported by 
recent observations of Lyman break galaxies. 

5. The growth seen in the clustering of absorbers is consistent with gravitationally induced 
growth of perturbations. 
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A. Calculation of the Likelihood Function C. 


Recall that the correlation function £ aa (r,z) relates the observed density in 
absorber pairs, n(r,z), separated by comoving distance r at redshift z, with the 
in the number of pairs, no(r,z), from an unclustered absorber population at the 
and redshift: 


d 2 n 

drdz 


d?riQ 

drdz 


[1 + £aa(r,z)] • 


the number of 
expected density 
same separation 

(Al) 


The likelihood C is the probability of the data D , given a model (described by a set of 
parameters M ) for the correlation function £ aa (r, z): C = P(D\M). The data D consist of an 
observed set of absorber pairs i, each with comoving separation r* at redshift z t , contained in a 
small cell of size thy fo* —> 0. We include in the likelihood not only the probability P,( 1) that one 
pair was observed in the ith cell Sri Szt , but also the probability P k ( 0) that no pair was seen in the 
remaining empty cells Sr k Sz k : 


c = nw)xn^(o) 

i ky^i 


n 

i 

n 


d 2 


n 


drdz 

d 2 n 


Sri Szt exp — 


d 2 


n 


C n,zi) 


drdz 


Sri Szi I xfl exp 

( n,Zi ) / k^i 


d 2 


n 


drdz 


) 


Sr k Sz k 

(■Tfc,2fc) J 


drdz 


(• n,zi) 


Sri Szi x ]^[ exp — 


d 2 


n 


drdz 


Sr k Sz k 


(A2) 


(r*,,Zfc) 


In taking the limit as the cell size goes to zero, and substituting equation (Al) into 
equation (A2), we find: 


log £ 


lim > log 

Sri fei—>0 A - ' 





^!°g [i + U(:n, 


d 2 n$ 

drdz 


£ aa ( r j z) drdz + C , 


(A3) 


where the constant C is just a function of the data and has no dependence on the model 
parameters. We compute no(r,z) and the integral by generating a large number of simulated 
catalogs of unclustered absorbers, using a bootstrap Monte Carlo method described in § 4 and in 



QVY. The integral term is then the mean (£ a a}]VlC of the model correlation function, obtained by 
averaging £ aa (r, z) over all random pairs in the Monte Carlo simulation, with r and z falling in the 
domain of interest: 


log/: = log [1 + £ aa 0i, Zi)\ - (U)mc + c . 


(A4) 
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Fig. 1.— Line-of-sight correlation function, £ a a(r), for the entire sample of C iv absorbers, as a 
function of absorber comoving separation, r, in 4 logarithmic bins from 1 to 16 /i _1 Mpc. The 
vertical error bars through the data points are 1 a errors in the estimator for £ aa - Also shown 
is a power-law fit of the form £ aa (r) = (ro/r) 7 , with maximum-likelihood values 7 = 1.75 and 
comoving ro = 3.4 h _1 Mpc ( qo = 0.5). 

Fig. 2.— Credible region for the parameters in the power-law fit of the form £ aa (r) = (ro/r) 7 . The 
cross marks the maximum-likelihood location, at ro = 3.4 h Mpc (qo = 0.5) and 7 = 1.75. The 
heavy and light contours mark the 1 a and 2 a credible regions, respectively. 

Fig. 3.— Posterior distribution for the power-law index 7 . Vertical lines mark the 1 a credible 
region, 1.05 < 7 < 2.25. 

Fig. 4.— Posterior distribution for the comoving correlation length ro- Vertical lines mark the 1 a 
credible region, 2.4 /i _1 Mpc < ro < 4.1 /r _1 Mpc (q 0 = 0.5). 

Fig. 5.— Mean correlation function, £o(z), averaged over comoving scales r from 1 to 16 h Mpc, 
as a function of redshift. Shown are values for the low (1.2 < z < 2.0), medium (2.0 < z < 2.8), 
and high (2.8 < z < 3.6) redshift C IV sub-samples, as well as for the Mg 11 sample (0.3 < z < 1.6). 
The solid line is a maximum-likelihood fit of the form £ 0 ( 2 ) oc (1 + z) _ ( 3+e ) +7 , with e = 2.05 and 
7 = 1.75 (qo = 0.5). The dashed line is the best fit when only the C IV data are used (see text). 

Fig. 6 .— Credible region for the parameters in the evolving power law fit of the form £ aa (r, z) = 
(ro/r ) 7 . We have fixed 7 = 1.75 and zq = 2.2, and qo = 0.5. The cross marks the 

maximum-likelihood location, at e = 2.05 and ro = 3.2 /i _1 Mpc. The heavy and light contours 
mark the 1 a and 2 a credible regions, respectively. 

Fig. 7.— Posterior distribution for the comoving correlation length ro at redshift zq = 2.2. Vertical 
lines mark the 1 cr credible region, 2.6 h _1 Mpc < ro < 3.8 h~ l Mpc (qo = 0.5). 

Fig. 8 .— Posterior distribution for the evolutionary parameter e. Vertical lines mark the 1 a 
credible region, 1.05 < e < 3.05 (qo = 0.5). 
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